Picture for Tong Zhao

Tong Zhao

Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation

Add code
May 28, 2026
Viaarxiv icon

Turning Video Models into Generalist Robot Policies

Add code
May 27, 2026
Viaarxiv icon

AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning

Add code
May 23, 2026
Viaarxiv icon

ATIR: Towards Audio-Text Interleaved Contextual Retrieval

Add code
Apr 22, 2026
Viaarxiv icon

Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation

Add code
Apr 09, 2026
Viaarxiv icon

Semantic IDs for Recommender Systems at Snapchat: Use Cases, Technical Challenges, and Design Choices

Add code
Apr 05, 2026
Viaarxiv icon

OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework

Add code
Mar 25, 2026
Viaarxiv icon

Improving Diffusion Generalization with Weak-to-Strong Segmented Guidance

Add code
Mar 21, 2026
Viaarxiv icon

Few-Step Diffusion Sampling Through Instance-Aware Discretizations

Add code
Mar 18, 2026
Viaarxiv icon

FlexRec: Adapting LLM-based Recommenders for Flexible Needs via Reinforcement Learning

Add code
Mar 12, 2026
Viaarxiv icon